Facial Feature Extraction using Deformable Graphs and Statistical Pattern Matching

نویسنده

  • Jörgen Ahlberg
چکیده

In model-based coding of image sequences containing human faces, e.g., videophone sequences, the detection and location of the face as well as the extraction of facial features from the images are crucial. The facial feature extraction can be regarded as a optimization problem, searching the optimum adaptation parameters of the model. The optimum is defined as the minimum distance between the extracted face and a face space. There are different approaches to reduce the computational complexity, and here a scheme using deformable graphs and dynamic programming is described. Experiments have been performed with promising results. I. MODEL-BASED CODING Since the major application of the techniques described in this document is model-based coding, an introduction to that topic will follow here. For more details, see [2, 9, 10, 14]. The basic idea of model-based coding of video sequences is illustrated in Fig. 1. At the encoding side of a visual communication system (typically, a videophone system), the image from the camera is analysed, using computer vision techniques, and the relevant object(s), for example a human face, is identified. A general or specific model is then adapted to the object, usually the model is a wireframe describing the 3-D shape of the object. Instead of transmitting the full image pixel-by-pixel, or by coefficients describing the waveform of the image, the image is handled as a 2-D projection of 3D objects in a scene. To achieve this, parameters describing the object(s) are extracted, coded and transmitted. Typical parameters are size, position and shape. To achieve acceptable visual similarity to the original image, the texture of the object is also transmitted. The texture can be compressed by some traditional image coding technique, but specialized techniques lowering the bit-rate considerably for certain applications have recently been published [15, 16]. At the receiver side of the system, the parameters are decoded and the decoder’s model is modified accordingly. The model is then synthesized as a visual object using computer graphics techniques, e.g., a wireframe is shaped according to the shape and size parameters and the texture is mapped onto its surfaces. In the following images, parameters describing the change of the model are transmitted. Typically, those parameters tell how to rotate and translate the model, and, in case of a non-rigid object like a human face, parameters describing motion of individual vertices of the wireframe are transmitted. This constitutes the largest gain of the model-based coding, since the motion parameters can be transmitted at very low bitrates [1]. Definitions for coding and representation of parameters for model-based coding and animation of human faces are included in the newly set international standard MPEG-4 [12, 13]. Components of a Model-Based Coding System To encode an image sequence in a model-based scheme, we need first to detect and locate the face. This can be done by, e.g., colour discrimination, detection of elliptical objects using Hough-transforms, connectionist/neural network methods, or statistical pattern matching. E nco der D eco der Channel M odel M odel O riginal

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gabor feature constrained statistical model for efficient landmark localization and face recognition

Feature extraction and classification using Gabor wavelets have proven to be successful in computer vision and pattern recognition. Gabor feature based Elastic Bunch Graph Matching (EBGM), which demonstrated excellent performance in the FERET evaluation test, has been considered as one of the best algorithms for face recognition due to its robustness against expression, illumination and pose va...

متن کامل

Analysis and Synthesis of Facial Expressions by Feature-Points Tracking and Deformable Model

Face expression recognition is useful for designing new interactive devices offering the possibility of new ways for human to interact with computer systems. In this paper we develop a facial expressions analysis and synthesis system. The analysis part of the system is based on the facial features extracted from facial feature points (FFP) in frontal image sequences. Selected facial feature poi...

متن کامل

Pii: S0031-3203(96)00086-6

-We propose an improved method for eye-feature extraction, descriptions, and tracking using deformable templates. Some existing algorithms are exploited to locate the initial position of eye features and then deformable templates are used for extracting and describing the eye features. Rather than using original energy minimization for matching the templates, the region-based approach is propos...

متن کامل

Facial Feature Extraction using Eigenspaces and Deformable Graphs

In model-based coding of image sequences containing human faces, eg videophone sequences, the detection and location of the face as well as the extraction of facial features from the images are crucial. The facial feature extraction can be regarded as an optimization problem, searching the optimum adaptation parameters of the model. The optimum is defined as the parameter set describing the fac...

متن کامل

Automatic Face Recognition via Local Directional Patterns

Automatic facial recognition has many potential applications in different areas of humancomputer interaction. However, they are not yet fully realized due to the lack of an effectivefacial feature descriptor. In this paper, we present a new appearance based feature descriptor,the local directional pattern (LDP), to represent facial geometry and analyze its performance inrecognition. An LDP feat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999